In mathematics, a set of elements of a vector space is called a basis (: bases) if every element of can be written in a unique way as a finite linear combination of elements of . The coefficients of this linear combination are referred to as components or coordinates of the vector with respect to . The elements of a basis are called .
Equivalently, a set is a basis if its elements are linearly independent and every element of is a linear combination of elements of . In other words, a basis is a linearly independent spanning set.
A vector space can have several bases; however all the bases have the same number of elements, called the dimension of the vector space.
This article deals mainly with finite-dimensional vector spaces. However, many of the principles are also valid for infinite-dimensional vector spaces.
Basis vectors find applications in the study of crystal structures and frames of reference.
The scalars are called the coordinates of the vector with respect to the basis , and by the first property they are uniquely determined.
A vector space that has a finite set basis is called finite-dimensional. In this case, the finite subset can be taken as itself to check for linear independence in the above definition.
It is often convenient or even necessary to have an total order on the basis vectors, for example, when discussing orientation, or when one considers the scalar coefficients of a vector with respect to a basis without referring explicitly to the basis elements. In this case, the ordering is necessary for associating each coefficient to the corresponding basis element. This ordering can be done by numbering the basis elements. In order to emphasize that an order has been chosen, one speaks of an ordered basis, which is therefore not simply an unstructured set, but a sequence, an indexed family, or similar; see below.
More generally, if is a field, the set of tuple of elements of is a vector space for similarly defined addition and scalar multiplication. Let be the -tuple with all components equal to 0, except the th, which is 1. Then is a basis of which is called the standard basis of
A different flavor of example is given by . If is a field, the collection of all in one indeterminate with coefficients in is an -vector space. One basis for this space is the monomial basis , consisting of all : Any set of polynomials such that there is exactly one polynomial of each degree (such as the Bernstein basis polynomials or Chebyshev polynomials) is also a basis. (Such a set of polynomials is called a polynomial sequence.) But there are also many bases for that are not of this form.
Most properties resulting from the Steinitz exchange lemma remain true when there is no finite spanning set, but their proofs in the infinite case generally require the axiom of choice or a weaker form of it, such as the ultrafilter lemma.
If is a vector space over a field , then:
If is a vector space of dimension , then:
Let, as usual, be the set of the tuple of elements of . This set is an -vector space, with addition and scalar multiplication defined component-wise. The map is a linear isomorphism from the vector space onto . In other words, is the coordinate space of , and the -tuple is the coordinate vector of .
The inverse image by of is the -tuple all of whose components are 0, except the th that is 1. The form an ordered basis of , which is called its standard basis or canonical basis. The ordered basis is the image by of the canonical basis of
It follows from what precedes that every ordered basis is the image by a linear isomorphism of the canonical basis of and that every linear isomorphism from onto may be defined as the isomorphism that maps the canonical basis of onto a given ordered basis of . In other words, it is equivalent to define an ordered basis of , or a linear isomorphism from onto .
Typically, the new basis vectors are given by their coordinates over the old basis, that is, If and are the coordinates of a vector over the old and the new basis respectively, the change-of-basis formula is for .
This formula may be concisely written in matrix notation. Let be the matrix of the and be the of the coordinates of in the old and the new basis respectively, then the formula for changing coordinates is
The formula can be proven by considering the decomposition of the vector on the two bases: one has and
The change-of-basis formula results then from the uniqueness of the decomposition of a vector over a basis, here that is for .
Like for vector spaces, a basis of a module is a linearly independent subset that is also a generating set. A major difference with the theory of vector spaces is that not every module has a basis. A module that has a basis is called a free module. Free modules play a fundamental role in module theory, as they may be used for describing the structure of non-free modules through .
A module over the integers is exactly the same thing as an abelian group. Thus a free module over the integers is also a free abelian group. Free abelian groups have specific properties that are not shared by modules over other rings. Specifically, every subgroup of a free abelian group is a free abelian group, and, if is a subgroup of a finitely generated free abelian group (that is an abelian group that has a finite basis), then there is a basis of and an integer such that is a basis of , for some nonzero integers For details, see .
The common feature of the other notions is that they permit the taking of infinite linear combinations of the basis vectors in order to generate the space. This, of course, requires that infinite sums are meaningfully defined on these spaces, as is the case for topological vector spaces – a large class of vector spaces including e.g. , , or Fréchet spaces.
The preference of other types of bases for infinite-dimensional spaces is justified by the fact that the Hamel basis becomes "too big" in Banach spaces: If X is an infinite-dimensional normed vector space that is complete space (i.e. X is a Banach space), then any Hamel basis of X is necessarily uncountable. This is a consequence of the Baire category theorem. The completeness as well as infinite dimension are crucial assumptions in the previous claim. Indeed, finite-dimensional spaces have by definition finite bases and there are infinite-dimensional ( non-complete) normed spaces that have countable Hamel bases. Consider the space of the of real numbers that have only finitely many non-zero elements, with the norm Its standard basis, consisting of the sequences having only one non-zero element, which is equal to 1, is a countable Hamel basis.
The functions are linearly independent, and every function f that is square-integrable on 0, is an "infinite linear combination" of them, in the sense that
for suitable (real or complex) coefficients a k, b k. But manyNote that one cannot say "most" because the cardinalities of the two sets (functions that can and cannot be represented with a finite number of basis functions) are the same. square-integrable functions cannot be represented as finite linear combinations of these basis functions, which therefore do not comprise a Hamel basis. Every Hamel basis of this space is much bigger than this merely countably infinite set of functions. Hamel bases of spaces of this kind are typically not useful, whereas orthonormal bases of these spaces are essential in Fourier analysis.
It is difficult to check numerically the linear dependence or exact orthogonality. Therefore, the notion of ε-orthogonality is used. For spaces with inner product, x is ε-orthogonal to y if (that is, cosine of the angle between and is less than ).
In high dimensions, two independent random vectors are with high probability almost orthogonal, and the number of independent random vectors, which all are with given high probability pairwise almost orthogonal, grows exponentially with dimension. More precisely, consider equidistribution in n-dimensional ball. Choose N independent random vectors from a ball (they are independent and identically distributed). Let θ be a small positive number. Then for
random vectors are all pairwise ε-orthogonal with probability . This growth exponentially with dimension and for sufficiently big . This property of random bases is a manifestation of the so-called .
The figure (right) illustrates distribution of lengths N of pairwise almost orthogonal chains of vectors that are independently randomly sampled from the n-dimensional cube as a function of dimension, n. A point is first randomly selected in the cube. The second point is randomly chosen in the same cube. If the angle between the vectors was within then the vector was retained. At the next step a new vector is generated in the same hypercube, and its angles with the previously generated vectors are evaluated. If these angles are within then the vector is retained. The process is repeated until the chain of almost orthogonality breaks, and the number of such pairwise almost orthogonal vectors (length of the chain) is recorded. For each n, 20 pairwise almost orthogonal chains were constructed numerically for each dimension. Distribution of the length of these chains is presented.
The set is nonempty since the empty set is an independent subset of , and it is Partial order by inclusion, which is denoted, as usual, by .
Let be a subset of that is totally ordered by , and let be the union of all the elements of (which are themselves certain subsets of ).
Since is totally ordered, every finite subset of is a subset of an element of , which is a linearly independent subset of , and hence is linearly independent. Thus is an element of . Therefore, is an upper bound for in : it is an element of , that contains every element of .
As is nonempty, and every totally ordered subset of has an upper bound in , Zorn's lemma asserts that has a maximal element. In other words, there exists some element of satisfying the condition that whenever for some element of , then .
It remains to prove that is a basis of . Since belongs to , we already know that is a linearly independent subset of .
If there were some vector of that is not in the span of , then would not be an element of either. Let . This set is an element of , that is, it is a linearly independent subset of (because w is not in the span of , and is independent). As , and (because contains the vector that is not contained in ), this contradicts the maximality of . Thus this shows that spans .
Hence is linearly independent and spans . It is thus a basis of , and this proves that every vector space has a basis.
This proof relies on Zorn's lemma, which is equivalent to the axiom of choice. Conversely, it has been proved that if every vector space has a basis, then the axiom of choice is true. Thus the two assertions are equivalent.
|
|